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Abstract 

Extensible records were introduced by Mitchell Wand while studying 
type inference in a polymorphic X-calculus with record types. This paper 
describes a calculus with extensible records, F <: p, that can be translated 
into a simpler calculus, F <: , lacking any record primitives. Given 
independent axiomatizations of F K .p and F <. (the former being an 
extension of the latter) we show that the translation preserves typing, 
subtyping, and equality. 

F <: p can then be used as an expressive calculus of extensible records, 
either directly or to give meaning to yet other languages. We show that 
F <: p can express many of the standard benchmark examples that appear in 
the literature. 

Like other record calculi that have been proposed, F <: p has a rather 
complex set of rules but, unlike those other calculi, its rules are justified 
by a translation to a very simple calculus. We argue that thinking in terms 
of translations may help in simplifying and organizing the various record 
calculi that have been proposed, as well as in generating new ones. 
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1. Introduction 

Extensible records, and the associated notion of row variables, were introduced by 
Mitchell Wand while he was studying the problem of type inference in a polymorphic X- 
calculus with record types [Wand 87]; a row variable is a type variable ranging over the 
possible field-extensions of a record type. Many calculi of row variables have been 
produced since then [Jategaonkar Mitchell 88] [Remy 89] [Wand 89] [Harper Pierce 90] [Cardelli 
Mitchell 91], and many more can be imagined. As we try to increase the expressiveness of 
these calculi, the axiomatization techniques become more and more divergent and 
complex. To be able to compare and discuss these different calculi, we feel the need of 
some more fundamental framework. This paper suggests that a very simple calculus of 
subtyping can be used as a basis for studying much more complex calculi of extensible 
records. 

In the search for a unifying framework, we can adopt the following working 
hypothesis: every reasonable calculus of row variables should be reducible to a calculus 
without row variables, via a well-behaved translation. The purpose of this hypothesis is 
not to eliminate row variables completely, since the translated programs would become 
too verbose to be useful; the purpose is to gain insights in the study of calculi with row 
variables. Even if our working hypothesis turns out to be false, which it may well be, we 
will have distinguished the easier features that can be translated from the more complex 
ones that cannot. 

To carry out this plan, we need to fix a suitable target calculus for the translation. 
Since we are studying type variables, a likely choice would seem to be the second-order 
X-calculus (system F [Girard 71] [Reynolds 74]). To express the idea that the translation is 
well-behaved, we require some basic soundness properties such as the preservation of 
typing, subtyping, and equality relations. But, in order to preserve subtyping relations, we 
need to translate to a target calculus that still has a notion of subtyping; otherwise we 
would gain little insight about the complex subtyping relations induced by extensible 
records. For a similar reason, we are not interested in untyped target calculi, for which 
translations are easily obtainable. 

As target calculus we use therefore an extension of F with subtyping, called F K . (F- 
sub), which has been Studied recently [Curien 90] [Curien Ghelli 91] [Cardelli Martini Mitchell 
Scedrov 91]. The fact that a translation of extensible records into F <: is at all possible also 
gives us new evidence about the expressiveness of F <: , and reinforces our feeling that 
F <: can be regarded as a canonical calculus of subtyping. 

Before the main discussion, we briefly review the motivations that led to the notions 
of row variables and extensible records. 

In a calculus with records, a program may contain expressions like r.l where r denotes 
a record value and the label / denotes a field of that record; then the record selection r.l 
denotes the value of the field labeled / in record r. 
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Given the expression r.l we can infer that rhas a type of the form Rcd(l:A), that is, a 
record type having a field labeled / of type A ; the type A is to be determined later. Given 
another expression r.l' in the same program, we can then infer that r has a type of the 
form Rcd(l:A,l':A'), and so on. 

This form of typing, though, becomes insufficient when considering record updates. 
The expression r.l<—a denotes a record similar to r, except that the value of its / 
component is updated to a. Consider now the program: 

p = A(r) r.l<—a 

Assuming a: A, and for any type B, we can infer the typing: 

p: Rcd(l:B)->Rcd(l:A) 

Given a record value rcd(l=b,l'=b'), having two fields labeled / and /' with respective 
values b and b', we consider legal the expression p(rcd(l=b,l'=b')) because the argument 
has all the fields required by the type of p. This expression then receives the type 
Rcd(l.A), because of the typing of p above. Unfortunately, by this typing we have 
forgotten that the argument of p, and hence its result, has another component labeled /'. 
This is unsatisfactory. 

To capture the kind of polymorphism required by the record update operation, we 
introduce row variables. Record types are extended to the more general form 
Rcd(lj:Aj,..,l n :A n ,X), where X is a row variable intended to represent "all the other fields" 
of a given record type; in this case all the fields except the ones labeled l]..l n . We can 
then assign to the program p the more informative type: 

p: Rcd(l:B,X)^Rcd(l:A,X) 

Now, in p(rcd(l=b,l'=b',l"=b")), where b'.B' and b":B", the row variable X is bound to 
l':B',l":B" (a row type), producing the expected result type Rcd(l:A,l':B',l":B") by 
substitution of l':B',l":B" for X. 

In this form of type inference we must keep track of constraints on the row variables, 
such as the fact that X in the example above must not come to contain / components 
(otherwise we would have a duplicate label). These constraints can be made manifest by 
adopting a type system featuring explicit polymorphism; then program p receives the 
typing: 

p: V(Y) V(xf I) Rcd(l:Y,X)—>Rcd(l:A,X) 

Hereof/ means that X is undefined at label / (that is, X can be bound only to row types 
that have no / components). Appropriate types and rows must then be explicitly supplied 
as arguments top, as in: 

p(B )(l ':B ', I ":B ")(rcd(l=b, l'=b',l "=b ") ) 
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This is finally a satisfactory typing of p, although for practical reasons we may require 
some type inference to avoid writing down the type arguments (B ) and (l':B',l":B"). We 
do not discuss type inference here, which we consider as a pragmatic variation on the 
basic calculus. 

In Wand's original view, and in further developments [Remy 89] [Harper Pierce 90], row 
variables are type variables of a different kind. In contrast, in [Cardelli Mitchell 91] we 
studied an explicitly polymorphic type system where both row variables and type 
variables are instances of second-order type variables, therefore unifying the two 
concepts. In this paper we go back to the original view that row variables are separate, but 
we show that they can ultimately be expressed as ordinary type variables. 

In outline, this paper shows how a calculus with row variables, F <: p, can be 
represented in a simpler calculus without row variables, F < ., via a translation. Given 
independent axiomatizations of F <: p and F < . (the former being an extension of the latter) 
we prove that the translation is well-behaved, in that it preserves typing, subtyping, and 
equality. 

The paper is organized as follows. Sections 2 and 3 recall the definition of F < . and its 
expressive power (borrowing from [Cardelli Martini Mitchell Scedrov 91]). Section 4 gives the 
main intuitions of the encoding of extensible records in F <: . Section 5 describes F < .p. 
Section 6 gives the translation of F <: p into F <: , and finally section 7 shows that the 
translation is sound. 

Examples of the expressive power of F <: p and comparisons with other calculi are 
delayed until section 5.5. We show there that F < .p can express many of the standard 
benchmark examples that appear in the literature. We encourage readers to examine these 
examples whenever convenient. 

Readers who wish to learn about F <: p as a language of records but who are not 
interested in the translation into F <: , may confine themselves to sections 1, 2.0, 2.1, 2.2, 
5.0, 5.1, 5.2, 5.4, 5.5, and 8. 

2. System F <: 

In this section we describe the target calculus, F <: , for the translation that will follow. 
F <: can be translated in turn into a trivial extension of F called Fj [Breazu-Tannen Coquand 
Gunter Scedrov 89]. However, the known translations from F <: to Fj do not preserve 
subtyping in F <: [Martini 90]; this reinforces the point that translating to F <: is more 
informative than translating directly to F. 

F <: is obtained by extending F with a notion of subtyping (<:). This extension allows 
us to remain within a pure calculus. That is, we introduce neither the basic types nor the 
structured types normally associated with subtyping in programming languages. Instead, 
we show that these programming types can be obtained via encodings within the pure 
calculus. In particular, we can encode record types with their subtyping relations [Cardelli 
88]. 
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2.1 Syntax 

The syntax of F <: extends the syntax of F as follows. A new type constant Top 
denotes the supertype of all types. Second-order quantifiers acquire a subtype bound: 
V(X<:A)A' (bounded quantifiers [Cardelli Wegner 85]). Ordinary second-order quantifiers 
are recovered by setting the quantifier bound to Top; we use V(X)A for V(X<:Top)A. The 
syntax of values is extended by a constant top of type Top, and by a subtype bound on 
polymorphic functions, A(X<:A)a. We use MX)a for MX<:Top)a. 

Syntax 

A,B ::= Types 

X type variable 

Top the supertype of all types 

A— » B function space 

V(X<:A)B bounded quantification 

a,b ::= Values 

x value variable 

top canonical value of type Top 

X(x:A)b function 

b(a) application 

X(X< :A )b bounded type function 

b(A ) type application 

A subtyping judgment is added to F's judgments. Moreover, the equality judgment on 
values is made relative to a type; this is important since values in F <: can have many 
types, and two values may or may not be equivalent depending on the type those values 
are considered as possessing. 

Judgments 

h E env E is a well-formed environment 

E h A type A is a type 

E h A <: B A is a subtype of B 

E\- a : A a has type A 

E h a <-* b : A a and b are equal members of type A 

We use dom(E) for the set of variables defined by an environment E. 

As usual, we identify terms up to renaming of bound variables; that is, using 
C{X^DJ for the substitution of D for X in C : 

V(X<:A)B = V(Y<:A)B(X^YJ 
Mx:A)b = X(y:A)(b{x^yJ) 

l(X<:A)b = X(Y<:A)(b{X^YJ) 
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These identifications can be made directly on the syntax, that is, without knowing 
whether the terms involved are the product of formal derivations in the system. By 
adopting these identifications, we avoid the need for a type equality judgment. 

Environments, however, are not identified up to renaming of variables in their 
domains; environment variables are kept distinct by construction. A more formal 
approach would use de Bruijn indices for free and bound variables [deB 72]. 



2.2 Rules 

The inference rules of F <: are listed below; we now comment on their most 
interesting aspects. 

The subtyping judgment, E h A<:B, defines, for any E, a reflexive and transitive 
relation on types with a subsumption property: a member of a type is also a member of 
any supertype of that type. Every type is a subtype of Top. The function space operator 
— * is antimonotonic in its first argument and monotonic in its second. A bounded 
quantifier is antimonotonic in its bound and monotonic in its body. 

The rules for the typing judgment, E h a: A, are the same as the corresponding rules in 
F, except for the extension to bounded quantifiers. However, additional typing power is 
hidden in the subsumption rule, which for example allows a function to take an argument 
having a subtype of the function's input type. 

Most of the equivalence rules, E\- a<-^b:A, are unremarkable. They provide 
congruence over the syntax, and /3 and 77 equivalences. Two rules, however, stand out. 
The first, (Top collapse), states that any two terms are equivalent when "seen" at type 
Top. Since no operations are available on members of Top, all values are 
indistinguishable at that type; this fact will have many interesting consequences in the 
sequel. The second, (Eg appl2), is the congruence rule for polymorphic type application, 
giving general conditions under which two expressions b'(A') and b "(A ") are equivalent at 
a type C. This rule also has many intriguing consequences, but these will not be explored 
here. They are described in [Cardelli Martini Mitchell Scedrov 91]. 



Environments 

(Env0) (Envx) (EnvX) 

E\- A type x^dom(E) EVAtype X^dom(E) 

h 0 env h E,x:A env h E,X<:A env 



Types 



(TypeX) 

h E,X<:A,E' env 
E,X<:A,E' h X type 



(Type Top) 

h E env 
E h Top type 
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(Type -^) 
Eh A type Eh B type 

EhA^Btype 



(Type V) 

E,X<:A h B type 
E\- V(X<:A)Btype 



Subtypes 



(Sub refl) 

E\- A type 

Eh A <:A 

(SubX) 
h E,X<:A,E' env 

E,X<:A,E'\-X<:A 



(Sub -^) 
E\-A'<:A 



(Sub trans) 

E\-A<:B 



E\-B<:C 



E\-B<:B' 



EhA^B <:A'^B' 



Eh A <: C 

(Sub Top) 
E\- A type 

E\-A<: Top 

(Sub V) 

Eh A '<:A E,X<:A ' h B<:B ' 
EV- V(X<:A)B <: V(X<:A')B' 



Values 



( Subsumption ) 

Eha.A E\-A<:B 
Eh a: B 

(Val fun) 

E,x:A h b:B 
E h X(x:A)b : A—>B 



(Val fun!) 



E,X<:A h b:B 



(Valx) 
h E,x:A,E' env 

E,x:A,E' h x:A 

(Val appl) 

Eh b : A —>B Eh a:A 
Ehb(a) :B 



(Val top) 

h E env 
E h top : Top 



EhX(X<:A)b: V(X<:A)B 



(Val appl2) 

Ehb:V(X<:A)B Eh A'<:A 
Ehb(A') : B{X*-A'} 



Equivalence 

(Eq symm) 

Eh a^b : A 
Ehb^a:A 

(Eqx) 

E h x:A 



Ehx* 

(Eqfun) 



x : A 



(Eq trans) 

Eh a <-> b : A Eh b 



c:A 



E h a c : A 

(Eq collapse) 

Eh a : Top Eh b : Top 



E h a <-*■ b : Top 



E,x:Ah b^b' : B 



(Eq appl) 

Ehb^b' . 



A^B Eh a* 



EhX(x:A)b ^ X(x:A)b' : A^B 



Ehb(a) ^ b'(a') :B 
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(Eqfun2) 



E,X<:A h b<->b' : B 



(Eq appl2) 

EVb'^b" : V(X< :A )B E\- A \A "< :A 
E h BfX^A'j, B{X^A"J <: C 



E h l(X<:A)b l(X<:A)b' : V(X<:A)B 

(Eq Eta) 

E\-b ** b' : A^B y^dom(E) 



(Eq Eta2) 

E\- b V : V(X<:A )B Y^dom(E) 



E\-b'(A') ** b"(A") • C 



E h X(y:A)b(y) ** V : A^B 



E\- X(Y<:A)b(Y) V : V(X<:A)B 



(Eq Beta) 

E,x:A \- b <-> b' : B E\- a <-> a' : A 



(Eq Beta.2) 



E,X<:A \-b^ b':B E h A' <: A 



E h (Mx:A)b)(a) ^ bfx^a'J :B E\- (X(X<:A)b)(A') ^ b'fX^A'J : BfX^A'J 

This calculus was first extracted by Pierre-Louis Curien from the one in [Cardelli 
Wegner 85] and studied by him and Giorgio Ghelli [Curien Ghelli 91] under the name F<. The 
present F <: is a refinement of F<, achieved mostly by extending the (Eq appl2) rule. It is 
Studied in [Cardelli Martini Mitchell Scedrov 91]. 

The following derived rules will be needed later. Their proofs follow from the 
lemmas listed in section 5.3 for F < .p. (Those lemmas hold for F < . as well, when 
restricted to the syntax of F <: .) 

Lemma (subsumption equivalence) 

The subsumption rule extends to the equality judgment: 

(Eq subsumption) 

E\-a^a':A E\-A<:B 
EVa^ a' : B 

Lemma (domain restriction) 

Iff: A— then/is equivalent to its restriction f\ A , to a smaller domain A'<:A, when 
they are both seen at type A'— That is: 

(Eqfun') 

E\-A'<:A E\-B<:B' E,x:A h b<->b' : B 
E h X(x:A)b ^ X(x:A')b' : A'^B' 

Lemma (bound restriction) 

Iff: V(X<:A)B, then/is equivalent to its restriction f\ A , to a smaller bound A'<:A, 
when they are both seen at type V(X<:A')B. That is: 
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(EqfunT) 

E\-A'<:A E,X<:A'\- B<:B' E,X<:A h b<->b' : B 
E\-X(X<:A)b ^ X(X<:A')b' : V(X<:A')B' 

3. Basic encodings 

Since F <: is an extension of F, it can express all the standard encodings of algebraic 
data types that are possible in F [Bohm Berarducci 85]. However, it is not clear that anything 
of further interest can be obtained from the subtyping rules of F <: , which involve only an 
apparently useless type Top and the simple rules for — * and V. 

In this section we begin to show that we can in fact encode rich subtyping relations on 
familiar data structures. In section 4 the encodings become more involved; this increase 
in complexity then motivates the switch to an independently axiomatized system (F <: p) 
in section 5. 



3.1 Booleans 

In the sequel of section 3 we concentrate on inclusion of structured types, but for this 
to make sense we need to show that there are some non-trivial inclusions already at the 
level of basic types. We investigate here the type of booleans, and in the process we 
illustrate some interesting consequences of the F <: rules. 

Starting from the encoding of Church's booleans in F, we can define three subtypes of 
Bool as follows (cf. [Fairbairn 89]): 

Bool = V(A)A->A->A 
True = V(A)A-> Top —>A 
False = V(A)Top^A^A 
None = V(A) Top^Top^A 

where: 

None <: True, None <: False, True <: Bool, False <: Bool 

Looking at all the closed normal forms (that is, the elements) of these types, we have: 

true Bool : Bool = A(A) Mx:A) My: A) x 

false Bool : Bool ± A(A) A(x:A) My: A) y 

true Tme : True = MA) Mx:A) My.Top) x 

false False : False = MA) M^:Top) A(y:A) y 

We obtain four elements of type Bool; in addition to the usual two, true Bool and false Bool , 
the extra true Tme and false " Fa i se have type Bool by subsumption. However, we can show 
that true Bool and true Tme are provably equivalent at type Bool, by using the domain 
restriction lemma ((Eg fan'), section 2.2). 
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E,A<:Top,x:A,y:Top hx*-»x:A E\- A<:Top 



E,A<:Top,x:A h X(y:Top) x <-> X(y:A) x : A^A (Eqfun') 

E,A<:Top h X(x:A) My:Top) x ^ )jx:A) Xjy.A) x : A^A-^A 

E h X(A) X(x:A) Xjy.Top) x *-> TjA) )jx:A) X(y:A)x : V(A) A^A-^A 

E h true Tme ** true Bool : Bo ° l 

Similarly, we can show that E h false False *-* false Bool : Bool. Hence, there really are 
only two different values in Bool. 



3.2 Products 

The standard encoding for pairs in F already exhibits useful subtyping properties: 

AxB = V(C)(A^B^C)^C 

Since both A and B occur in monotonic positions in AxB (being twice on the left of an 
arrow), we obtain the expected monotonic inclusion of products as a derived rule: 

E\-A<:A' E\-B<:B' 
Eh AxB <:A'xB' 

The operations on pairs are defined, as usual, as: 

pair : V(A) V(B)A^B^AxB 

= A(A) A(B) l{a:A) l{b:B) X(C) A(f:A^B^C) f(a)(b) 
fst : V(A) V(B)AxB^A 

= A(A) A(B) l(c:AxB) c(A)(X(x:A)X(y:B)x) 
snd : V(A) V(B)AxB^B 

= A(A) A(B) X(cAxB) c(B)(l(x:A)l(y:B)y) 

We often use the following abbreviations, disambiguated by context: 

a,b = a^xgb = pair(A)(B)(a)(b) 

fst(c) = fst AxB (c) = fst(A)(B)(c) 
snd(c) = snd^xgic) = snd(A)(B)(c) 



3.3 Enumerations 

Enumeration types (that is, finite sets) form another collection of base types with 
interesting inclusion relations. We describe them here because they show an interesting 
use of the Top type, and hint at the encoding of tuples in the next section. 

The enumeration of zero elements can be defined as: 

N 0 ^ V(A)Top^A 
This type has no closed normal forms, hence no "elements". 
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The enumeration of one element is defined as: 

Nj = V(A)AxTop^A 

This type has just one closed normal form: 

onej.-Nj = MA) Mx.AxTop)fst(x) 

Moreover, Nq <: Nj because AxTop<:Top. 

The enumeration of two elements is defined as: 

N 2 = V(A)AxAXTop-»>A 

This type has the two closed normal forms: 

one 2 :N 2 = MA) M^-AxAxTop) fst(x) 
two 2 :N 2 = MA) Mx:AxAxTop)fst(snd(x)) 

Moreover,^ <: N 2 , and by subsumption: 

one j : N 2 

We find that N 2 has three elements. As for booleans, we can prove that two of these are 
equal in N 2 : 

h one j <-> one 2 : N 2 

At this point the pattern of enumeration types should be clear: 

N n = \/(A)Ax..xAxJop^A 

n times 

with N n <: N n+] , where N n has n distinct elements. 
3.4 Tuples 

A tuple type Tuple(Aj,...,A n ,C) denotes an iterated product type. Its last slot, C, can be 
filled with any type. When C is a type variable, we have an extensible tuple type. When it 
is Top, we have a simple tuple type. 

Tuple(C) = C 

Tuple(A p ...,A n ,Q ± AjX(...x(A n xC)..) n>l 
Hence we have: 

Tuple(Aj,...,A n ,Tuple(Bj,...,B m ,C)) = Tuple(A 1 ,...,A n ,B 1 ,...,B m ,C) 

with derived rule: 

E\-Aj<:Bj ... E\-A n <:B n E\~C<:D 
E\-Tuple(Aj,...,A n ,C) <: Tuple(B p ...,B n ,D) 

As a special case we obtain the rule for simple tuples: 
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E\-A 1 <:B 1 ... E\-A n <:B n EV- A n+1 type ... E\-A m type 



EV- Tuple(A p ...,A n ,...,A m ,Top) <: Tuple(B p ...,B n ,Top) 



For example: 

Tuple(A, B, Top) <: Tuple(A, Top) 

since A <: A,BxTop <: Top, and xis monotonic. 

We note here that the type Top assumes a very useful role, in allowing a longer tuple 
type to be a subtype of a shorter tuple type. The intuition is that a longer tuple value can 
always be regarded as a shorter tuple value, by "forgetting" the additional components, 
and this is possible since everything is forgotten in Top. 

For tuple values we have: 

tuple(c) = c 

tuple(a 1 ,...,a n ,c) = a p (...,(a n , c)..) n>l 
tuple(aj,...,a n ,tuple(b p ...,b m ,c)) = tuple(a 1 ,...,a n ,b p ...,b m ,c) 
with derived rules: 

E h a j : Aj ... E h a n : A n E\- a : A 



E\- tuple(aj,...,a n ,a) : Tuple(Aj,...,A n ,A) 

E\- d]<->bj : Aj ... E\- a n <-^b n : A n E\- a^b : A 
E\- tuple(aj,...,a n ,a) <-> tuple(bj,...,b n ,b) : Tuple(Aj,...,A n ,A) 

The basic tuple operations are: aLi, dropping the first i components of tuple a; and a.i, 
selecting the z'-th component of a. These are defined by iterating product operations; we 
use the abbreviations: 

aLi = aL A i = drop^A^a) = snd^a) 

a.i = a. A i = sel^A^a) = fst(aLi) 

i 

More precisely: 

drop 0 : V(A 0 )A 0 ^A 0 

± %A 0 )X(t:A 0 )t 
sel 0 : V(A 0 ) AgXTop-^Ag 

± MA 0 ) Mt:A 0 xTop)fst A()>{rop (drop 0 (A 0 xTop)(t)) 
dropj : V(Aj) TopxAj^Aj 

± X(A j) X(t:TopxA j) snd TopxA (drop 0 (TopxA j)(t)) 
selj : V(A 1 )Top>A 1 xTop^A 1 ° 

± KAj) Mt:Top>A ] ^rop)fst A ^ op (dwp ] (A ] xTop)(t)) 

etc... 

We obtain the derived rules: 
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E h a : Tuple(A 0 ,..,Aj.jA) E\- a : Tuple(A 0 ,..,Aj,A) 

E\- aii :A E\- a.i : A i 

E\- a 0 : Aq ... E h a^j : A^j E\- a : A E\-a 0 :A 0 ... E\~a i :A i E\- a : A 
E h tuple(a 0 ,...,a i _ 1 ,a)Li *-* a : A E\- tuple(a 0 ,...,a i ,a).i *-> a i : A i 

Example: 

letf: V(X<:Tuple(B,Top))Tuple(A,X)^Tuple(A,A,X) = 
X(X<:Tuple(B,Top)) X(t:Tuple(A,X)) tuple(t.O, t.O, til) 

f(Tuple(B,C,Top))(tuple(a,b,c,top)) <-> tuple(a,a,b,c,top) 
: Tuple(A,A,B,C,Top) 

We have now developed the necessary techniques for encoding record types; this is 
the subject of the next section. 

4. Records 

The general plan, carried out in later sections, is to axiomatize the rules for records 
independently, and then provide a translation (encoding) into a calculus without records. 
In this section we are a bit more informal, and we discuss the encoding of record types 
without first discussing their derived type rules. Some pathologies caused by this 
approach will disappear later. 

4.1 Simple records 

Let L be a countable set of labels, enumerated by a bijection ieL—»Nat. We indicate 
by / ■, with a superscript, the z-th label in this enumeration. Often we need to refer to a list 
of n distinct labels out of this enumeration; we then use subscripts, as in lj..l n . So we may 
have, for example, Ipl^h = l 5 M,l 17 - More precisely, lj..l n stands for l a ( 1 \..,l <T ( n ) for some 
injective o e l..n—>Nat. 

A record type has the form Rcd(lj.Aj, .., l n :A , C), where the final type C will 
normally be either Top or a type variable. Once the enumeration of the set of labels L is 
fixed, a record type is encoded as a tuple type where the record components are allocated 
to tuple slots as determined by the index of their labels. That is, the component of label /• 
is allocated to the z'-th tuple slot; the remaining slots are filled with Top "padding". For 
example: 

Rcd(l 2 :C, l°:A, D) ± Tuple(A, Top, C, D) 

Since record type components are canonically sorted under the encoding, two record 
types that differ only in the order of their components will be equal under the encoding. 
Hence we can consider record components as unordered. 
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As an artifact of the encoding, a missing record field of label I - is equivalent to a field 
Z ; - : Top. However, the type rules for these two situations will differ, and in the former 
case the extraction of the label / ; - will not be allowed. 

A record type whose final component is Top is called a simple record; one whose 
final component is a type variable, is called an extensible record, or simply a record. 
Only these two situations will be allowed by the type rules for records; for example, 
notice that Rcd(l°:A, Rcd(l ] :B, C)) is not very meaningful under the translation. 

From the encoding, we can derive the familiar rule for simple records [Cardelli 88]: 

E\-Aj<:Bj ... E \~A n <: B n Eh- A n+1 type ... Eh- A m type 
E h- Rcd(l 1 :A 1 ,.J n :A Te .J m :A„ e Top) <: Rcd(l i: B p ..,l n :B n ,Top) 

The conclusion holds because any additional field l k :A k (n<k<m) on the left of <: is 
absorbed either by the Top padding on the right, if i(l k )<max( l(lj)..l(l n )), or by the final 
Top, otherwise. For example: 

Rcd(l°:A, l*:B, l 2 :C, Top) = Tuple(A, B, C, Top) 
<: Tuple(Top, B, Top) = Rcd(l ] :B, Top) 

Record values are similarly encoded, for example: 

rcd(l 2 =c, l°=a, d) = tuple(a, top, c, d) 

from which we obtain the rules for simple records: 

E} ~ a l :A 1 - E ^ a n :A n 



Eh- rcd(lj=aj,..,l n =a n ,top) : Rcd(lj:A p ..,l n :A n ,Top) 

Eh-aj^a'j :Aj ... Eh- a n ^a' n : A n 

Eh- rcd(lj=aj,..,l n =a n ,top) <-> rcd(lj=a' j,..,l n =a' n ,top) : Rcd(l 1 :A p ..,l n :A n ,Top) 
Record selection is encoded as follows: 

r.h ± r.Kh) 
with the rule: 

Eh- r : Rcd(l:A,Top) 
Eh- r.l : A 

By subsumption, we have the following derived rules: 

Eh-aj.-Aj ... EVa n :A n ... EVa m :A m 
Eh- rcd(l j=a p ..,l n =a n ,..,l m =a m ,top) : Rcd(l 1 :A 1 ,..,l n :A n ,Top) 

Eh- aj<->bj : Aj ... E h a n ^b n : A R 
Eh-a n+1 :B n+1 ... Eh-g p :B p Eh- b n+1 : C n+1 ... E\-b q :C q 

Eh- rcd(lj=aj,..,l n =a n ,..,l p =a p ,top) <-> rcd(lj=bj,..,l n =b n ,..,l q =b q ,top) 
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: Rcd(l 1 :A 1 ,.J n :A re Top) 
E\-r: Rcd( lj :Aj,..., l n :A n , Top ) ieL.n 
E\-r.l i :A i 

The second rule above is particularly interesting. It expresses a form of observational 
equivalence: two records are equivalent at a given type if they coincide with the 
components that are observable at that type. Ultimately, this is because any two values 
are equivalent at type Top. 

An interesting question about simple records remains: what is the equivalent of the L 
operator on tuples? To answer this, we must turn to extensible records. 



4.2 Extensible records 

In the next section we fully axiomatize a system with row variables, F <: p. To 
understand that axiomatization better, it may be useful to have an idea of the translation 
into F <: that will follow. In this section we sketch the main ideas of that translation, but 
the reader can skip to section 5 at any point. 

As we have done with tuples, we would like to place a type variable at the end of a 
record to capture all the "additional" components. 

Tuple(A, B, C, X) X represents all the other tuple components 

Rcd(l°:A, l 2 :C, X) X represents all the other record components 

When translating these records into tuples, we see that, to achieve the desired effect, the 
final type variable must split into a set of type variables. (We use the symbol ~ to mean, 
informally, "translates to".) 

Rcd(l°:A, l 2 :C, X) ~ Tuple(A, X 1 , C, X 3 ) 

Here X cannot be bound to a single (record) type; it must be bound to a labeled 
collection of types that fills the slots X 1 andX- 3 exactly. We call these collections type 
rows, and X a row ( type) variable. 

Consider, for example: 

Rcd(l°:A, l 2 :C, l 4 :E, X) 

Here the row variable X can be instantiated only to a type row that does not contain 
components labeled 1°, I 2 , or f, since these are already accounted for. For example, X can 
be instantiated to the type row l ] :B, l 3 :D, Top. 

We express this constraint on the instantiations of X by saying that X must have kind 
"t flyl 2 ,! 4 ", which reads "... is undefined (exactly) at Z 0 ,/ 2 ,/ 4 " or "... does not cover 
(exactly) l°,l 2 ,t". 

A constrained row variable X is hence translated to a sequence of type variables 
with "gaps" at L; for example: 
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xt() * xo 

xf/ 2 ^x°,x 7 ,x 3 xti 1 ,? ^x°,x 3 

Therefore, the first step in extending F <: with row types is to allow constrained row 
variables in environments: 

E',xtL, E"\- ... 

Then, if xt L ~ Xj, ...,X n we translate: 

E',xt L h RcddfAj, l m :A m , X) type 

* X n h Tuple(B p B n+m _ p XJ type 

where the 5 ; are the X 1 ...X n _ 1 and the A j—A m , in the proper order. 

To manipulate type rows and row variables we introduce a new judgment form 
(described in detail in the next section): 

E\-R tL 

where R is a type row (including a row variable), and L is the set of labels that are not 
covered by the row R. In general we need to translate not just records, but rows, which 
may have missing components: 

(l°:A, l 2 :C, X)tl 1 ,lf t ~ A, - , C, X 3 , - , X 5 (a row missing 1 st and 4 th ). 

Once row variables are allowed in environments, they give rise naturally to 
quantifiers V(xt L), and binders X(xt L). These row quantifiers and row binders must 
decompose under translation into sequences of type quantifiers and type binders. For 
example, we have: 

Vixtl^RcdilhAtX) -^B 

~ V(X°) V(X 2 ) Tuple(X°,A,X 2 ) -* B 

We now come to the most important issue of the translation: matching the number of 
arguments of a row type function MX t L)a to the number of parameters in a row type 
application (A(XI L)a)(RI L). The application form for a function b: V(XtL)B will have 
the shape: 

b(Rt L) : B(X<—RJ for Rt L 

where B{X<—R} is a row substitution such that (for £ a row variable or Top): 

Rcd(lj:A p ...,l n :A n , X) {X*-(V 1 :B p ...,V m :B m $)} = 
Rcd(l 1 :A p ...,l n :A n ,V 1 :B 1 ,...,l m :B m ^) 

We have seen that the translations of V(XI L)B and L)b convert the single 
parameter XI L into a sequence of parameters whose length can depend only on L. We 
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call this length dL: the dimension of L. When translating an application b(Rt L) we must 
then produce a sequence of applications of size dL, irrespectively of the actual parameter 
R. This may require some regrouping of the components of an argument row R. For 
example: 

b(l 2 :A 2 , Y t 1 1 1 3 ) (where b: V(X t lH 3 )B and Y t lH 2 l 3 ~ Y°, Y 4 

~ b(Y°)(A 2 )(Y 4 ) and I 2 : A 2 , Y 1 1 1 1 3 % Y°,A 2 , Y 4 ) 

b(l 3 :A 3 , Y t 1 1 1 2 ) (where b: V(X t lH 2 )B and Y t lH 2 l 3 ~ Y°, Y 4 

~ b(Y°)(Tuple(A 3 , Y 4 ) ) and l 3 :A 3 , Y 1 1 1 1 2 ~ Y°,A 3 , Y 4 ) 

In the second case, b(Y°)(A 3 )(Y 4 ) would be wrong; we must group A 3 and Y 4 into 
Tuple(A 3 ,Y 4 ), to match the two parameters (X° and X 3 ) expected by b. For uniformity in 
the translation, we always take the last parameter to be a tuple (since Tuple(A)=A), so the 
first case above becomes: 

b(l 2 :A 2 ,Y UH 3 ) 

~ b(Y°)(A 2 )(Tuple(Y 4 )) 

In conclusion, we can say informally that row variables translate to rows of variables, 
row types to rows of types, row quantifiers to rows of quantifiers, row applications to 
rows of applications, etc. The main difficulty in the translation is to ensure that all these 
rows match properly. For this, the precise relation between a row R t L and its dimension 
dL, will be discussed in section 6. 

We now turn to a formal system based on the intuitions about the translation of 
records into tuples developed in this section. 

5. System F <: p 

We now extend F <: with records and row variables, as discussed in section 4; the 
resulting system is called F <: p. 

5.1 Syntax 

Types in F <: are augmented by the following: record types Rcd(R), where R is a row 
type that must be defined at all labels; row function types Rt L —> B from an input of row 
type R f L to an output of type B; and row variable quantifications V(Xi L)B, where L is a 
set of labels at which X is undefined 

A row type is either the constant Etc, standing for an "empty row" (more precisely, an 
unnamed extension of the current row type); a type variable X, standing for an extension 
of the current row type; or l:A,R, extending the row type R by a field of type A and label /. 

Values are augmented by the following: records rcd(r), where ris a row value defined 
at all labels; row functions X(x. .R f L)b accepting a row value for x of row type R f L; and 
row type functions A(XfL)b accepting a row type forX that is undefined at L. Record 
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selection a. / can be used on a record a that is defined at /. A row function b can be applied 
via b(rfL) to a row value r undefined at L. A row type function b can be instantiated via 
b(R f L) to a row type i? undefined at L. 

Finally, a row value is either the constant etc, standing for an "empty row" (or, an 
unnamed extension of the current row value); a row variable x; an extension l=a,r, 
extending row value r by a field of value a and label /; or a restriction a\L, producing a 
row value undefined at L from a record a. 



Syntax 



L ::- l h .., l n 

A,B ::= ... 
Rcd(R) 
Rt L—>B 
V(xtL)B 

R,S::= 
X 
Etc 
l:A,R 

a,b ::= ... 
rcd( r) 
a. I 

X(x.-.RtL)b 
b(rtL) 
X(XtL)b 
b(RtL) 

r,s ::= 

x 

etc 

l=a,r 

a\L 



Label set 

Types as in F <: , plus: 
record type 
row function space 
row quantification 

Row types 

row type variable 
empty row type 

row type R plus field A labeled / 

Values as in F <: , plus: 
record value 
record selection 
row value function 
row value application 
row type function 
row type application 

Row values 

row value variable 

empty row value 

row value rplus field a labeled / 

row value of record a without fields in L 



As discussed in section 2, we identify terms up to renaming of bound variables: 

V(xtL)B = V(YtL)B(X^YJ 
X(XtL)b = X(XfL)b{X^YJ 
Mx-:RtL)b = X(y.-.R1 ' L)b{x^y} 

Moreover, we identify rows up to reordering of labeled components: 

l:A,l':A',R = l':A',l:A,R 
l=a,l'=a',r = l'=a',l=a,r 
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and we identify terms up to any permutation L' of a label set L: 

V(XtL)B = V(XfL')B RtL^B = RtL'^B 

l{Xth)b = X(xtL')b b(RtL) = b(Rt L') 

Mx.:RtL)b = X(x.:RtL')b b(rtL) = b(rtL') 
aXL = dSL' 

Again, these identifications are legitimate because they depend only on the syntax of 
terms, and not on their derivations. 

Given the identification of label sets above, we adopt the following notational 
convention used in the inference rules: 

l.L ± {IJuL where feL 

We now add to F <: four judgments about rows, which all involve a set L at which the 
rows are undefined. 

Judgments 

Judgments as in F <: , plus: 
E\-pR I L R is a row type not covering L 

E hp r.R f L r has row type Rt L 

E\- P R<.. stL R is a subrow of S, both not covering L 

E hp r <-> r'. .R t L r is equal to r' at row type R t L 

It is important to notice that the L information is preserved exactly in F <: p 
derivations, in the sense that£h P tf tL=>E\-p$tL ' is never derivable for LtL' for any 
of the four judgments. Hence, when we say that a row is undefined at L, we always mean 
undefined exactly at L. 

5.2 Rules 

We indicate by hp the judgments in F <: p, to distinguish them from the judgments h in 
F <: . The rules of F <: p consist of a copy of the rules of F <: (with h replaced by hp) plus 
the ones listed below. We now briefly comment on the F <: p rules. 

A row type is formed by starting with a row variable X t L, or with a row Etc t L, and 
then prefixing fields I: A with leL, at each step discarding / from L. Note that Etc can be 
assumed to lack any set of labels to start with. Informally, we can imagine either that an 
element of Etc f L is a collection of n=#L empty slots that are later "filled in", or that an 
element of Etc t L is an infinite row with "gaps" corresponding to L, and with all the other 
components filled with an error value. 

A record type can be formed only from a complete row R t (), one lacking no labels. 
(We call R f () complete even though we have only finite information about the labels of 
R; for example, Etcf() is complete but entirely unknown.) This completeness 
requirement is probably not essential, but gives us a simpler calculus where record types 
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carry only positive information, while row variables carry only negative information 
[Harper Pierce 90] . 

The subrow judgment, E hp R <. . S t L, is mainly an auxiliary one used to define 
subtyping on records. According to this judgment, every row is a subrow of Etc; then we 
have componentwise subtyping on fields having the same label. Hence, a longer row 
ending in Etc is a subrow of a shorter row ending in Etc if their corresponding 
components are in subtype relation. Rows ending with the same type variables must have 
the same length (otherwise, assuming X f I, what could L be in E hp l:A,X 
Rows ending in distinct type variables are unrelated, since we have no information about 
the labeled types that may be substituted for the variables. 

Record values can be created only from complete rows, as discussed above. Given a 
record a : Rcd(l:A,R) we can select its I component by a.l : A. Moreover, given a record a 
: Rcd(lj:Aj..l n :A n ,R) we can extract a row aXL .-. Rt L from it by removing all the 
components with labels in L. 

In F <: any two values are equivalent in Top. Similarly, in F <: p any two row values 
are equivalent in Etc. 



Environments 

(Envx t L) 

E\- P R tL x?dom(E) 
hp E,x. .R t L env 



(EnvX tL) 
\-pEenv X^dom(E) 

hp E,xf L env 



Types 



(Type Red) 

E\-pR t_Q 
EVp Rcd(R) type 



(Type - tL) 

E\- P R tL E\- P Btype 
E\-pR t L—>B type 



(Type V tL) 

E,X t L hp B type 

E\- P V(XtL)B type 



Row types 

(TypeX) (Type Etc) 
hp E',xt L,E" env VpEenv 

E\ xt L,E" \-pXtL EVpEtcth 



(Type cons) 

EVpR tl.L EVpAtype 
EV P l:A,R tL 



Subtypes 



(Sub Red) 

E\- P R<. 



R'f() 



(Sub ->■ tL) (Sub V tL) 

E\-pR'<.-.R t L E\- P B<:B' E,xt L hp 5 <: B' type 



E\- P Rcd(R) <: Rcd(R') £h P Rt L^B <: R't L^B' E\- P V(xt L)B <: V(xt L)B' type 
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Subrows 



(Sub Rowrefl) 
E\- P R fL 

E\- P R<.. R tL 

(Sub Etc) 

E\- P R tL 

E\- P R<.. Etc tL 



(Sub Row trans) 

E\- P R<.:StL E\- P S<.:TtL 
E\- P R<.: T tL 

(Sub cons) 

E\- P A<:B E\- P R<: S tj.L 
E\- P l:A,R < l:B,S tL 



Values 



(Val red) 

E\- P r.:R t_Q 

Ehprcd(r) : Rcd(R) 

(Val fun tL) 

E,x. R tL \- P b :B 
EVp X(x.-.Rt L) b : Rt L^B 

(Valfun2 t L) 

E,xtL\- P b :B 
E\- P A(XtL)b: V(XtL)B 



(Valsel) 

E\- P a : Rcd(l:A,R) 



E\-p a.l : A 



(Val appl f L) 

EVpb :RtL^B E\- P r.-.R^L 
E\- P b(rtL):B 

(Valappl2 fL) 

E\-pb : V(XtL)B E\- P R tL 
EV P b(RtL) :B(X^RJ 



Row values 

(Row Subsumption) 

E\- P r.R tL E\- P R < S tL 
E\- P r.:S tL 



(Valx tL) 

\- P E',x.:RtL,E" env 
E',x.:RtL,E"\- P x.:R tL 



(Val etc) (Val cons) (Val restr) 

hp E env E hp r.R t l.L E hp a:A EV P a: Rcd(lj:Aj..l n :A n ,R) 

E hp etc .: Etc t L £ hp l=a,r .-. l:A,R t L £ hp a\lj..l n .-. R t lj..l n 



Value equivalence 



(Eq red) 



Eh or 



E hp rcd(r) ^ rcd(r') : Rcd(R) 
(Eqfun f L) 

E,x.R tL hp b<->b' : B 

E hp A(x..R tL)b ^ X(x.:R tL)b' :Rt L^B 



(Eq set) (EqEvalsel) 

E\-p a^a' : Rcd(l:A,R) Ehp r.R U E\- P a^a':A 
E\- P a.l *-* a'.l : A £ hp rcd(l=a,r).l *-* a' : A 



(Eq appl t L) 

E\-p b<->b' : Rt L—>B E hp r<->r'.-.R t L 
E\-p b(rtL) ^ b'(r'tL) :B 
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(Eqfun2 f L) 

E,xfL\-pb 



b':B 



(Eq appl2 f L) 

E\- P b^ b' : V(XtL)B E\-pR t L 



E\- P A(XtL)b^A(XtL)b': V(XtL)B 

b':B E\- P r*->r'.-.R tL 



(Eq Beta f L) 

E,x.-.RtL\-pb 



E hp b(R tL)**b'(Rt L) : B(X< 

(Eq Eta f L) 



-RJ 



E\- P b b' : Rt L^B ytdom(E) 



E hp (tyx.-.R tL)b)(r) ** b'{x^r'J : B 
b':B E\- P R tL 



b' : Rf L—*B 

EVpb^ V : V(XTL)B Y?dom(E) 

E\- P (A(Xt L)b)(Rt L)^b'(X^RJ : B(X^RJ £h P A(Yt L)b(Yt L)<->b' : V(Xt L)B 



(EqBeta2 t L) 

E,xfL\- P b 



EV P My.-.RfL)b(ytL) 

(EqEtal f L) 



Row value equivalence 

(Eq Row symm) 

E\- P r«-*s.:R tL 
E\- P s^r.:R tL 



(Eq Row trans) 

E\- P r^s.:R tL E\- P s**t-.R tL 



E\-p r 



t:. 



R tL 



(Eq Row Subsumption) 

E\-p r<->r'.-.R t L E\- P R<.-.stL 



( Eq Row collapse ) 

E\-pr.-.Etc tL E\-ps.-.Etc tL 



E\- 



p r< 



>r',S t L 

(Eq etc) 



E\-p r ■ 



Etc tL 



(Eq x tL) 

E\- P x.:R tL 

E\-p x <-> x ■■■ R t L E\-p etc 



hp E env 



etc 



(Eq restr) 

E hp a^a' : Rcd(l 1 :A J ..l n :A n ,R) 
E\- pt N 1 ..l H ~a\l 1 ..l H .:R tl^ 

Example derivations 

hp E env 

E\- P Etc tl 3 ,l 5 E\- P Atype 
E hp I 3 : A, Etc tl 5 E\- P B type 
E\- P l 5 :B,l 3 :A,Etc t() 
E hp Rcd(l 5 :B,l 3 :A,Etc) type 



( Eq cons ) 

E hp r*-+r'..R f l.L E hp a^a'.A 
-. Etc tL E hp l=a,r <^> l=a',r' .-. l:A,R tL 

(Eq Eval restr) 

E hp r++r'.:R [ l v .l n E hp a 1 :A 1 ... E hp a n :A n 
E\- P rcd(l 1 =a 1 ..l n =a n ,r)\l 1 ..l n <-> r' .: R *lj..l n 



\- P E,xtl 3 ,l5 env 

E,X 1 1 3 , l 5 \- P xt I 3 , 1 5 E\- P A type 
E,X 1 1 3 , 1 5 hp l 3 :A,X tl 5 Ej-pB type 
E,xtl 3 ,l 5 \- p l 5 :B,l 3 :A,X t_Q 
E,xtl 3 ,l 5 \- P Rcd(l 5 :B,l 3 :A,X) type 



5.3 Properties 

We now state some basic lemmas about the properties of F <: p derivations (and, 
implicitly, of F < . derivations). Unless otherwise noted, these are all proven by induction 
on the derivations; the proofs are long, but straightforward if done in the order indicated. 
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Notation 

Let ~& be any of 

Ctype, S tM, C<:C, S<.S' t M, c:C, s.-.S t M, c^c':C, s++s'.S t M 

Lemma (renaming) 

Let <^,fi,fi'> stand for either <X, Y, X<:D, Y<:D>, <X, Y, xt M, yl M>, 
<x, y, x:D, y:D>, or <x, y, x.-.Tt M, y.-.Tt M>. 
Assume £,'tdom(E,fi,E'). 

\- P E,J3,E' env => hp E,J3',E'{^g} env 

E,fi,E'h P $ E,fi\E{^}V P n^j 

Lemma (implied judgments 1) 

(■&/env) \-pE,Fenv => VpEenv 

E,F hp $ => \-pE env 
(env/type) \-pE,X<:D,E' env => EVpDtype 

hp E,x:D,E' env => E\-pD type 
(env/rowtype) \-p E,x. rT L,E' env => E\-pR^L 

Lemma (bound change) 

hp E,X<:D',E' env, EVpDtype \- P E,X<:D,E' env 

E,X<:D',E'\- P Ctype, EVpDtype => E,X<:D,E'\- P C type 
E,X<:D',E'\- P S fM, EVpDtype => E,X<:D,E' hp S t M 

Lemma (weakening) 

Let fi stand for either xt L, X<:D, x:D, or x:. TtL. 
Assume \~pE,fi env, andX,x^dom(E'); then 

hp E,E' env => hp E,fi,E' env 

E,E'\- P E,J3,E'\- P 
Assume \-pE,Fenv and dom(F)ndom(E')=0; then 

\-pE,E'env => hp E,F,E' env 

E,E' hp E,F,E' hp 

Lemma (implied judgments 2) 

(sub/type) E\- P C<:C => £ hp Ctype, Eh P C type 

(subrow/typerow) E\- P S<.-.S' t M => E\- P S t M, E\- P S' t M 

Lemma (bound weakening) 

Let <fi,fi'> stand for either 

<X<:D, X<:D'>, <x:D, x:D'>, or <x..rIl, x..R'fL>. 
Assume E hp D'<:D and E\- P R'<..R t L. 

\-pE,fi,E' env => hp E,fi',E' env 

E,J3,E'\- P $ E,J3',E'\- P $ 
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Lemma (type substitution) 

Assume E hp D'<:D; then 

\- P E,X<:D,E' env => hp E,E'{X^D'J env 

E,X<:D,E'\- P & => E,E'{X^D'J\- P ${X^D'J 
Assume E hp S ? M; then 

\- P E,xtM,E'env => hp E,E'{X^S} env 

E,xtM,E'\- P $ => E,E'{X^SJ\-p${X^SJ 

Lemma (value substitution) 

Assume E hp d:D; then 

\-pE,x:D,E' env => \-pE,E'env 

E,x:D,E'\-p # => E,E'\- P ${x^dj 
Assume E hp t.-.T IN; then 

hp E,x.-.TI N,E' env => \-pE,E'env 

E,x.-.Tt N,E' hp E,E'\- P ${x^t} 

Lemma (value strengthening) 

Assume x?FV($); then, for c«-»c':C 

\-pE,x:D,E' env => \-pE,E'env 

E,x:D,E'\- P E,E'\- P &{x^d} 

Assume x^FV(-&); then, for # ^ r<-^r'. .R t L 

hp E,x.:Tl N,E' env => \-pE,E' env 

E,x.-.Tt N,E' \- P & => E,E'\-p$ 

Lemma (implied judgments 3) 

(val/type) EVpc : C => EVpC type, 

(rowval/rowtype) E\- P s.-.S t M => E\- P S t M, 

(eq/val) E hp c^c' : C => £ hp c : C, EV P c' : C, 

(roweq/rowval) E\- P s^s'.:S tM => E\- P s.-.S t M, E\- P s'.-.S t M, 

Lemma (subsumption equivalence) 

E\- P c^c':C, E\- P C<:D => E\- P c^c':D 
Proof By subsumption and beta; see [Cardelli Martini Mitchell Scedrov 91] □ 

Lemma (implied judgments 4) 

(val/eq) E\-p c : C => E hp c<->c : C 

(rowval/roweq) E\- P s.-.SiM => E\-p s<->s. S t M 

Lemma (exchange) 

Let fi stand for either X<:D, Yf M, x:D, or x.-. Tt M. 
Let fi' stand for either X'<:D', Y'tM', x':D\ or x'.-.T't AT. 
Assume hp E,fi' env. 

hp E,fi,fi',E' env => hp E,fi',fi,E' env 

E,fiJ3',E' hp 0 => EJ3'J3,E' hp # 
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We can now show that an observational equivalence rule for records is derivable. 
This rule asserts that two record values are equal at a given type if all the equally-labeled 
fields that can be observed at that type are equal. 

Proposition (observational equivalence for records) 

E\-p aj<-^bj:Aj a ... a E\-p a n <-^b n :A n a E\-pr.-.R f lj..l n a E\-ps.-.S ' 

E\- P rcd(lj=aj,..,l n =a n ,r) rcd(lj=bj,..,l n =b n ,s) : Rcd(l 1 :A 1 ,..,l n :A n ,Etc) 

Proof 

Let L=lj..L. 

env (implied judgment) 
E\-pr.-.RtL => E\-pRtL (implied judgment) 

E\-p R f L E\- P R < Etc t L (Sub Etc) 

E\-pr.:R fL a E\- P R<.:Etc t L => EV P r.-.Etc t L (subsumption equiv.) 
VpEenv => E hp etc^etc.Etc ? L (Eq etc) 

E\-pr.-.Etc 1 L a E\-p etc.-.Etc t L => E\-p r<->etc..Etc t L (Eq Row collapse) 
E\-ps.-.stL => E\-p etc^s..Etc ? L (similarly) 

E\-p r<-> etc.Etc tL a E\-p etc<-^s..Etc t L => E\-p r<->s..Etc fL (Eq trans) 

E\-pr.-.R t lj..l n a E\-ps.-.S t lj-l n => E\-p r<-^s..Etc 'lj-l n (above) 

E\-p aj<->bj:Aj a ... a E\-p a n <-^b n :A n a E\-p r<->s..Etc Tlj»l n 

£ hp lj=aj,..,l n =a n ,r <-> lj=bj,..,l n =b n ,s .-. lj:Aj,..,l n :A n ,Etc t() (Eq Row cons) 
=> E\- P rcd(l 1 =a 1 ,..,l n =a n ,r)^rcd(l 1 =b 1 ,..,l n =b n ,s):Rcd(l 1 :A 1 ,..,l n :A n ,Etc) (Eq red) 



5.4 Some useful extensions 

In preparation for examples in the next section, we discuss some useful extensions of 
our system: recursive types, label-set variables, and definitions. These extensions are not 
treated in the formal part of the paper. 

5.4.1 Recursive types 

In order to introduce recursive types, we need to add type equivalence judgments to 
the system along with rales (omitted here) for making type equivalence into a congruence 
over the syntax: 

E hp A «-* B type A and B are equivalent types 

E\-pR^stL RmdS are equivalent row types 

A recursive type is, syntactically, a term jJ.(X)A where A is contractive in X (written 
A>X). This means that A*X, and if A=ji(Y)B then B>X. We immediately identify 
recursive types up to renaming of bound variables: 

H(X)A = y(Y)A{X^Y} 
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Then, the rules for recursive types [Amadio Cardelli 91] are: 

( unfold) 

E,X<:Top hp A type A>X £ hp ji(X)A type 



E hp y,(X)A type E hp ^(X)A ^ A(X^ii(X)A } type 

E,X<:Top \- P A^B type A>X B>X 
E hp fi(X)A ^ ia(X)B type 

(contract) 

E hp A ^ CjX^A} type EV P B^ CfX^BJ type OX 
E hp A *-* B type 

E hp n(X)A type E hp fi( Y)B type E, Y<:Top,X<:Y \- P A<:B 
E\- P y,(X)A <:^(Y)B 

A recursive value is, syntactically, a term /n(x:A)a, with the identification: 

jl(x:A)a = /j(y:A)a{x<—yJ 

The standard rules for recursive values are: 

E,x:A \-pa : A E,x:A hp a : A E,x:A hp a<->b : A 



E hp jl(x:A)a : A E hp fl(x:A)a <-*■ a{x^^l(x:A)aJ : A E hp jl(x:A)a <-*■ jl(x:A)b : A 
5.4.2 Label sets 

The next extension involves variables W ranging over sets of labels. We allow these 
in environments, under an assumption WHL that Wdoes not contain any of the labels in L. 

E\-pLH M E\-pLH M E\- P LH l.M 

E\-pMHL E\- P LH0 E\- P LHM 

hp E env £ hp LH l.M 



E\- P 0H lj...l n .0 E\-pl.LHM 
E\-pLH 0 W{dom(E) \- P E,WH L env 



\- P E,WHLenv E, W H L, E'\- P W H L 

The rules of F <: p that involve label sets L, are extended to require L#0, to make sure that 
L is well-formed. We do not define quantifiers or functions over label- set variables 
because we do not know how to translate them into F <: ; label-set variables will be used 
only in definitions. 

5.4.3 Definitions 

We now extend the system with various flavors of definitions. The simplest 
definitions are value and row value definitions (let's): 
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E hp a : A E,x:A \-pb : B 



E\-p a : A E,x:A \-pb : B 



E\-p let x : A = a in b : B E\-p let x : A = a in b *-* b{x<—a} : B 

E\-pr.:RtL E,x..R f L\-pb : B Ehpr.-.R^L E,x.-.Rt LVpb : B 

E\-p let x ■■■ R t L = r in b : B E\-p let x .-. R t L = r in b <-> b{x<—r} : B 

There are several kinds of type-level definitions (Let's); we may give a definition of 
either a type variable, a row type variable, or a label- set variable, in the scope of either a 
type, a row type, a value, a row value, or a label-set. 

To compress several cases into one, we use the abbreviations: 

X,9^are either type, row type, or label-set variables; 
J3,!B,C are either types, row type, or label sets; 

Sla,'B6,Cc are either values, row values, types, row types, or label-sets; 

pred is either : A, .-. R f L, type, t L, or ttL. 

e% is either <:A, t L, or ttL (we often omit <:Top); 

!Aa{X} means Xmay occur in Act, then !Aa{ stands for SlalX*—®} 

For type, row type, and label-set definitions, in various scopes, we have the rules: 

Let X= SI in &B{X} = Let X'= !A in b<B{X '} 

EhpSleKi EV P <Bb{!A} -prtd EVp !Ae%. E hp <B6{Sl} pred 

£ hp LetXe%.= SI in <B6{ X] pred E hp Let Xe %. = SA in <Bb{X] ** <B6{ SA} pud 

Note that, unlike value definitions, we do not require E,Xe%\-p ( Bb{X] pred, this might not 
be typeable on its own. 

We also introduce parametric type-level definitions, for example: 

LetX[y,Z] = Sl{y,Z} in ... XfBjXj]... ^%c 2 7... 

for which we omit the obvious but technically complicated definitions. 
Finally, we use top level declarations, in the following way: 

let x : A = a 

lety : B = b stands for let x : A = a in lety : B = b in c 

c 

and similarly for Let. 

We now have enough useful features, and we can turn to examples. 

5.5 Examples 

Many examples in this section are adapted from [Canning Cook Hill Olthoff Mitchell 89] 
[Harper Pierce 90] and [Cardelli Mitchell 91]. 
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We start with a list of standard test cases and compare them with other calculi. 

• Extracting a field from a record that is known to possess it. 

let select x : Rcd(x:Nat,Etc)^Nat = 
h(a:Rcd(x:Nat,Etc)) a.x 

select Jrcd(x=3,y=true, etc)) *-» 3 : Nat 

• Extracting a field from a record that is not known to possess it. 

This is a typing error in all the calculi that have been proposed. 

• Removing a field from a record that is known to possess it. 

let restrict x : V(Xt x) Rcd(x:Nat,X)^ ...X... = 

X(Xt x) X(a:Rcd(x:Nat,X)) ... aSx ... (in a row context) 

restrict x (y :Nat,Etc ' x)( rcd(x=3,y=true,etc)) 

• Removing a field from a record that is not known to possess it. 

This is the crucial feature in [Cardelli Mitchell 91]. It is not possible here because the 
translation (section 6) requires exact knowledge of the missing fields. 

• Adding a field to a record that is known not to possess it. 

Not applicable; all records are already "complete". However, we can add a field to a 
row that is known not to possess it: 

X(r.-.R tx.L) ... x=b,r ... (in a row context) 

• Adding a field to a record that is not known to possess it. 

Not applicable; all records are already "complete". Moreover, even for rows, "not 
knowing" is not a sufficient condition for adding a field. This operation is possible in 
[Wand 87], [Remy 89], and [Cardelli Mitchell 91]. 

• Updating afield of a record that is known to possess it. 

Although adding a field under these conditions is not possible because all records are 
"complete", there is no problem with updating. Note that type information about 
additional input fields is preserved. This example motivated the work [Cardelli Mitchell 91]. 

letreplace x : V(xtx) V(A) Rcd(x:Top,X)^A^Rcd(x:A,X) = 
X(Xtx) 1{A) Mr:Rcd(x:Top,X)) l{a:A) rcd(x=a,r\x) 

replace x (y:Bool,Etc ? x)( String)( rcd(x=3,y=true,etc))("str") 
<-> rcd(x="str",y=true,etc) : Rcd(x:String,y:Bool,Etc) 

A restricted version, called consistent updating, preserves the type of the field being 
updated. 

letupdate x : V(xtx) V(A) Rcd(x:A,X)^A^Rcd(x:A,X) = 
X(Xtx) 1{A) l(b:Rcd(x:A,X)) X(a:A) rcd(x=a,b\x) 
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An interesting example of update occurs when "moving" the x field of a point. In this 
case we want to preserve the type of the y field (whatever subtype of Int that may be) and 
all the additional fields. If the input type of the x field is 0..9 (a proper subtype of Int), the 
corresponding output type must be Int, otherwise we could exceed the range 0..9 for x. 

letmove x : V(Y<:Int) V(zt x,y) Rcd(x:Int,y:Y,Z) -> Rcd(x:Int,y:Y,Z) = 
X(Y<:Int) %zfx,y) A(p:Rcd(x:Int,y:Y,Z)) rcd(p.x+l,p\x) 

p:Rcd(x:0..9,y:0..9, c: Color, Etc ) 

move x (0..9)(c:Color,Etc)(p) : Rcd(x:Int,y:0..9,c:Color,Etc) 

A more challenging task is to update "deep" in a structure, while preserving all the 
type information of the input. Here it can be achieved as follows, for a second-level 
boolean update. 

let deep-update : 

V(xfx) V(Yty) Rcd(x:Rcd(y:Bool,Y),X)^Rcd(x:Rcd(y:Bool,Y),X) = 
X(Xtx) MYty) l{a:Rcd(x:Rcd(y:Bool,Y),X)) 
rcd( x=rcd( y=not(a.x.y),a. x\y ), a\x ) 

deepUpdate xy (z:Nat,Etc T x)(w:Nat,Etc t y)(rcd(x=rcd(y=true,w=3,etc),z=4,etc)) 
*-» rcd( x=rcd(y=false,w=3, etc ),z=4, etc ) 

: Rcd( x:Rcd(y:Bool,w:Nat, Etc ), z:Nat, Etc ) 

• Updating a field of a record that is not known to possess it. 

Again, "not knowing" is not a sufficient condition here. 

• Renaming. 

Renaming is not possible in general. Consider, for example, Rcd(x:A,X)— >Rcd(y:A,X); 
what would be the constraint on XI 

We now pass to standard examples of "class hierarchies" and "methods". We use 
parametric type definitions, explained in section 5.4, to model record type extension, as in 
[Harper Pierce 90]. This technique compensates, up to a point, for the lack of the type 
operations of [Cardelli Mitchell 91]. 

• Points and color points 

A point has components x:Int, y.Int, while a color point also has a component 
c: Color. The challenge is to define the ColorPoint type and values by reusing the Point 
type and values. Here we can reuse types in two steps by defining a parametric version of 
each type. (Similarly for values.) This is an instance of a powerful generator technique, 
widely employed in [Cook 89]. 

Let PointPlus[Zt x,y] = 
Rcd(x:Int, y.Int, Z) 



Page 30 



Let Point = 

PointPlus[Etc] ( = Rcd(x,y:Int, Etc) ) 

Let ColorPointPlus[Zf x,y,c] = 

PointPlus[c: Color, Z] ( = Rcd(x,y:Int, c: Color, Z) ) 
Let ColorPoint = 

ColorPointPlus[Etc] ( = Rcd(x,y:Int, c: Color, Etc) ) 

let originPlus: V(ZI x,y) zf x,y—>PointPlus[Z] = 
Pi(Zf x,y) A(z-'-Zf x,y) rcd(x=0, y=0, z) 

let origin : Point = 

originPlus(Etc t x,y)(etc ?x,y) 

let whiteOriginPlus : V(Zf x,y,c) zf x,y,c—>ColorPointPlus[Z] = 

X(Zt x,y,c) X(z^-Zt x,y,c) originPlus(c: Color, Z t x,y)(c=white, z t x,y) 

let whiteOrigin : ColorPoint = 

whiteOriginPlus( Etc t x,y, c)(etct x,y, c ) 

• Total orders 

Here we have a record type TO of total orders. The ordering is represented as a 
method leq: TO^Bool, that compares another element of TO to the self value. The type 
TO is then recursive in the input type of its only method. 

The definition of TO is done in three steps; first we introduce a generator with open 
recursion (the SW/type parameter), then a generator derived from it where the recursion is 
closed, and finally the actual type TO. In general, the last two steps are obtained 
uniformly from the first. This technique is a bit complex, but it should be seen as a 
standard way of translating a "class" written in some more amenable language. 

Let TOGenPlus[Self, xf leq] = 

Rcd(leq: Self—>Bool,X) 
LetTOPlus[Xtleq] = 

H(Self) TOGenPlus[Self, X] ( = tfSelf) Rcd(leq: Self->Bool, X)) 
Let TO = 

TOPlus[Etc ] ( = rfSelf) Rcd(leq: Self Bool, Etc ) ) 

Next we define the total order of Naturals (by reusing TOGenPlus), as: 

Let NatTOGenPlusl Self, XI leq,val,add] = 

TOGenPlus [Self, (vahNat, add: Self Self, X)] 

( = Rcd( leq :Self—> Bool, vahNat, add: Self '—> Self, X)) 
Let NatTOPlus[Xt leq,val,add] = 
iMSelf) NatTOGenPlus[Self, X] 

( = jA(Self) Rcd(leq:Self—>Bool, vahNat, add: Self— > Self, X)) 
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Let NatTO = 

NatTOPlus[Etc] 

( = n(Self) Rcd(leq:Self->Bool, vahNat, add: Self : -* Self, Etc)) 

let zero : NatTO = 

rcd(val=0, add= pother: NatTO) other, 
leq=A( other :NatTO) 0<other.val, etc) 

(The methods of zero are too specialized to be inherited; this problem can be amended by 
defining a value generator with open recursion and, for example, leq=X (other: NatTO) 
self. val<other. val.) 

We now discover that, although NatTO was obtained by adding components to TO, it 
is not a subtype of TO by the rules for recursive types. Hence we have the unpleasant 
situation that operations defined on TO may not apply to particular total orders. 

The solution is to define those operations on TOPlus instead of TO. (As pointed out in 
[Harper Pierce 90] this can be done even without F -bounded quantification [Canning Cook Hill 
Olthoff Mitchell 89] in a calculus of "negative information", such as F <: p.) We can say that 
NatTOPlus is a subclass of TOPlus [Cook 89]. 

letmin : V(xt leq) TOPlus [X] ^TOPlus [X] ^TOPlus [X] = 
XfXtleq) Ma: TOPlus[X]) X(b: TOPlus[X]) 
ifa.leq(b) then a else b 

We can then specialize min to NatTO: 

let minNat: NatTO ^ NatTO ^ NatTO = 

min(val:Nat, add: NatTO —> NatTO, Etc / leq) 

to see that this typechecks, compute: 

TOPlus [val: Nat, add: NatTO >-* NatTO, Etc] 

= H(Self) TOGenPlus[Self, (vahNat, add: NatTO ^ NatTO, Etc)] 

= fl(Self) Rcd(leq: Self-* Bool, val: Nat, add: NatTO -^NatTO, Etc) (SI) 

+* NatTO ((B) 

The step from formula SI to formula H proceeds as follows, using the rules for recursive 
types given in section 5.4. By unfolding, we have: 

SI <-* Rcd(leq: SA^Bool, vahNat, add: <B->% Etc) 
<B Rcd(leq: l B->Bool, vahNat, add: Etc) 

Consider the contractive context C[X]\ 

C[X] = Rcd(leq: X-^Bool, val: Nat, add: <B^% Etc) 

Then SI <-*■ C[Sl] and t B<-* C[ %]; hence ^<^!Sby the contract rule. 
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• Movables 

Following the three-step schema, we now give type definitions for "things that can be 
moved". For added flexibility, the first step defines a row type instead of a record type, 
using a label-set parameter (explained in section 5.4). 

Let MovableGenPlus[ Self, L Hmove, Xl move. L] tL = 

move: lnt—>lnt—>Self, X 
Let MovablePlus[X t move] = 

H(Self) Rcd(MovableGenPlus[Self, 0,X]) 

( = fl(Self) Rcd(move: Int->Int->Self, X)) 

Let Movable = 

MovablePlus[Etc] ( = \±{Self) Rcd(move: Int-^Int-^Self, Etc)) 

let translate : V(Xt move) MovablePlus[X]—>lnt—>lnt—>MovablePlus[X] = 
X(XI move) X(m:MovablePlus[X]) X(dx:Int) X(dy:Int) m.move(dx)(dy) 

We can see that in this case Movable is a rather useless type. The interesting 
definition is MovablePlus, which however must be instantiated before it can be used. 
Hence, we combine movables with points: 

Let PointPlus[ Zl x,y] = 

Rcd(x:Int, y:Int, Z) 
Let Point = 

PointPlus[Etc] ( = Rcd(x:Int, y.lnt, Etc)) 

Let MPointGenPlusl Self, XI x,y,move] = 

PointPlus[MovableGenPlus[Self, (x,y), X]] 

( = Rcd(x:Int, y.lnt, move: Int—>Int^>Self, X)) 
Let MPointPlus[X t x,y, move ] = 
H(Self) MPointGenPlusl Self, X] 

( = jA(Self) Rcd(x:Int, y.lnt, move: lnt—>lnt—>Self, X)) 

Let MPoint = 

MPointPlus[Etc] ( = jl(Self) Rcd(x:lnt, y.lnt, move: Int^lnt-^Self, Etc)) 

let move : V(Xt x,y,move) MPointPlus[X]^lnt->lnt^MPointPlus[X] = 
X(Zt x,y,move) X(self:MPointPlus[X] X(dx:lnt) X(dy:lnt) 
rcd(x=self.x+dx, y=self.y+dy, selfx,y) 

let mOrigin : MPoint = 

\x(s elf :MPoint) rcd(x=0, y=0, move=move(Etct x,y,move)(self), etc) 

translate(x:lnt,y:lnt,Etc t move)(mOrigin)( 1 )( 1 ) : MPoint 

Note that in MPointGenPlus we have successfully reused the definitions for both 
points and movables. Moreover, move can be inherited by subclasses (as opposed to 
subtypes) of MPointPlus, by defining appropriate generators. 
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• Concatenation 

Record concatenation can be handled by adapting a technique of Remy [Remy 91]. 
With an extra level of encoding, record concatenation can be modeled by function 
composition; in our system, this idea can be realized as follows. 

We first define segments, as extensible records parameterized by their potential 
extensions: 

Seg(l p A p .J n :A n ) ± V(ztl p J n )ztl p J^Rcd(l p A p .J n :A n ,Z) 
seg(lj=a p ..,l n =a n ) = A(Zf Ij.JJ X(z--ZI l p .l n ) rcd(l 1 =a p ..,l r =a n ,z) 

A field of a segment can be extracted by precipitating the segment to a record: 

s.lj = s(Etct lj..l n )(etct lj-l n )-li where s : Seg(l P A p ..,l n :A n ), ieL.n 

Then, given two segments with distinct sets of labels: 

s : Seg(l p A p .J n :A n ) = V(zt L..IJ zt l p .l n ^Rcd(l p A p .J n :A n ,Z) 

t : Seg(k p B p ..,k m :B m ) = V(Zfk p .k m ) zt k p .k m ^Rcd(k p B p ..,k m :B m ,Z) 

we can define their concatenation (//) as follows: 

Mzt h..i n kj..k m ) Mz.-.zt lj..l n kj..k m ) 
S (k 1 :B p ..,k nl :B nf Ztl 1 l n ) 
(t(l p Top,..,l n :Top,Z f k p .k m )(lj=top,..,l n =top,z * k p .k m )\l p .l n ) 

so that we have: 

silt: Seg(l p A p ..,l n :A n ,k p B p ..,k m :B m ) 

It would now be possible to axiomatize an extension of F <: p with segments and 
concatenation, and define a translation of this extended calculus into F <: p. 

6. Translation of F <: p into F <: 

In this section we define the promised translation from a calculus with rows to one 
without rows. The basic idea is that row variables, row types, row values, and row 
judgments become rows or sequences of, respectively, variables, types, values, and 
judgments. 

We start with some familiar notation from previous sections: 



Notation 

L the set of labels 

l : L—>Nat (a bijection) a fixed enumeration of labels 

f = l~*(i) the label whose index is i in the fixed enumeration 

L,M... finite sets of labels 

#5 size of a finite set 
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Next we define the set of indices of a set of labels, and its maximum index: 

Definition (indices and maximum index of a set of labels) 

iL ± (i(l)UeLJ = fiU^L} 

ifL ± max(lL), where iff} = -1 

Finite sets of labels L are used mostly in contexts like ? L, describing the labels a row 
lacks. If we need to talk about the labels a row has, we can consider the complement L-L. 
This, though, is an infinite set, and the part beyond ftL is uninteresting. Hence, it is 
natural to take its most interesting finite prefix, kL: 

Definition (finite complement prefix of a finite set of labels) 

kL ± fi\i<ifLAl^L} 

A central concept in the sequel is that of the dimension of (the tuple translation of) a 
row. Take any row that is undefined at L; that is, any row whose tuple translation 
sketched in section 4.2 has gaps at L. Then the labeled components to the right of the last 
gap (ItL) are contiguous, and they can be collected into a single tuple; we call the result a 
normal row. The dimension dL of any row that has gaps at L is then defined as the 
number of components of the corresponding normal row. We emphasize that for any row 
r. R t L or R f L, its dimension depends only on L, and not on the structure of r or R. 
Hence dL can be defined very simply as: 

Definition (dimension of a row undefined at L) 

dL ± #(kL)+1 

When adding a new item to a row, the row dimension changes depending on whether 
the new item fills the last gap of the row or not. In the former case, a whole set of 
components may be compacted in the final tuple and the dimension decreases; in the 
latter case, the dimension increases by one. The following lemma is formulated in terms 
of adding or removing a gap. 

Lemma (row dimension) 

For l^L, 

if i<1tLibsnd(l t L)=dL-l; 

if i> itL then d(l v L)=dL+(i-(§L+l)). 

We now need some notation for describing complex sequences and rows, and for this 
purpose we use a notation similar to set comprehension. For example, we use l (i\2<i<4) 
to denote the sequence 2,3,4 in this order; the idea is that the superscript index / is 
increased monotonically to generate the elements of the sequence. 

Notation (sequences) 

#(S) length of a sequence 

S,S' sequence concatenation 



Page 35 



l ((p(i)\<t>( i)) sequence comprehension; the sequence, generated by 

increasing i, whose elements are cp(i) for ieNat a <P(i). 

A row is a sequence of labeled elements, sorted by label index, of length greater than 
zero. The last element of a row is special; as discussed in section 4.2 this is the rest of the 
row. For bookkeeping purposes, we use the special label 13 for this last element, where q 
is intuitively the index of the beginning of the uninteresting part of the row (as we can see 
from the row structure lemma below). 

Notation (rows) 

A type row R is a sequence of the form: 

lj:Aj..l n :A n where n>l and I = L q for some q. 
A value row r is a sequence of the form: 

lj=aj..l n =a n where n>l and / = L q for some q. 

#(lj:Aj..l n :A n ) = n; #(l ] =a ] ..l n =a n ) = n size 

/■:A- e lj:Aj..l n :A n ; /-=a- e lj=aj..l n =a n membership (ieL.ri) 

I: A ~* R ± i ((P:A i )\(V:A i )=(l:A) a (l':A')eR) sorting (if l:B*R for any B) 
l=a^r A H(r=ct)\(li=d)=(l=a) a (i l =a l )er) sorting (if l=b*r for any b) 

We can now define some basic sequences and rows that will be used in the 
translation. All these have dimension dL. 

Definition (basic sequences and rows) 



VarSeq(X, t L) 
VarRow(X, t L) 
TopRow( t L) 

varSeq(x, ' L) 
varRow(x, t L) 
topRow( t L) 



A 
A 



A_ 

A 



(X i \ieKL),X^ L+1 
((P:X)\ieKO , (L U+1 :X^ L+1 ) 
((P:Top)\i6KL) , 0 L+1 :Top) 

((^UeKL) ,0 L+1 =x^ L+1 ) 
({l l =top)\ieKL} , (L U+1 =top) 



selRow(a, t L) ± H(V=a.i)\ie kL) , (L ® L+1 =ci_ 1tL+l ) 
Examples 

VarRow(X, f()) = L 0 :X° TopRow( t()) = L°:Top 

VarRow(X, t ft) = L 1 :X ] TopRow( t ft) = L 1 :Top 

VarRow(X, 1 1 1 ) = ft:X°, l 2 :X 2 TopRow( 1 1 1 ) = ft -.Top, L 2 :Top 

VarRow(X, t ft, I 2 ) = I 1 :X ] , L 3 :X 3 TopRow( t ft, I 2 ) = I 1 .- Top, L 3 : Top 

In defining the full translation, I - ], we need an auxiliary translation, i - t U, for 
converting row types R ? L, and row values r t L, into rows of types and values, 
respectively. The results of i - I Lb are unnormalized, in the sense that they may have a 
dimension greater than dL; that is, the final tupleable components of the results need not 
be grouped together into a tuple. This auxiliary translation refers back to the proper 
translation, I - 1, but for exposition purposes we present it first. 
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Definition (translation, part 1; auxiliary row translation) 

iX^O = VarRow(XjL) 

{Etc tti = TopRow(tL) 
U:A,R tU ± 1:1 Al ^ iR U.U 

1xtU = varRow(x, t L) 

ietc ttt = topRow(tL) 
U=a,r tU ± 1=1 al — ir U.U 
4a\lj..l n tU = selRow(laljL) 

Hence, for the base cases iX tU and {Etc t Lt of the row type translation, we 
produce rows of X's or Top's of size dL. For U:A,R t L) we first compute {R t l.U , which 
has an additional gap for /, and we sort 1:1 Al into the result. 

Similarly for the row value translation. In addition, ia\lj..l n t U produces a row of 
record selections; the idea here is that eliminating lj..l n from a is the same as selecting 
and reassembling all the other components of a. (The type rules will ensure (lj..l n )=L, if 
a\lj..l n ?L is well-typed.) 

Here is an example of the translation: 

aU<>,ii,i3,p> = 

ll-X 2 , 14 : X 4 , P:X 5 , U:X 7 (of size d(l°, I 1 , 1 3 , 1 6 )) 

uihAi, p :A 6 jX ) u°,m = 

I 1 .-A 1 , l 2 :X 2 , f:X 4 , l 5 :X 5 , t:A 6 , L 7 :X 7 (of size greater than d(l°,P)) 

Next, we provide a kind of normal form for row types l]:Aj..l n :A n ,%, based on the 
translations QR \L\ (under typing assumptions). As we have seen, the translation returns 
rows whose length (which depends both on L and lj..l n ) may exceed dL. The normal form 
reveals that the portion beyond dL-1 has in fact no gaps and therefore can be collected 
into a tuple to form a single dL th element. Similarly for value rows. 

Lemma (row structure) 

(1 ) Let R=lj:Aj.. l n :A n , E, where %=X or %=Etc. 
Assume E\-pR t L. 

Then {R / Zj has the following shape, for some B's: 
H(frff)\ieiL) ,H(V:ff)\1tL<j<q), (L q :B<l) 
with q = (1tL+l) + (d(lj..l n .L)+n - dL) (q > itL) 

(2) Let r=lj=aj..l n =a n , ^ where £=x, %=etc, or %=a\M. 
Assume EVpr .-. R t L. 

Then ir t L) has the following shape, for some b's: 
H{V=V)\ietD ,H(P=b>)\itL<j<q), (L q =b«) 
with q = (ftL+1) + (d(l 1 ..l re L)+n - dL) (q > ifL) 

Considering the previous example: 
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HU.-A 1 , l 6 :A 6 ,X) U°,m = 

(I 1 .-A 1 , P:X 2 ), (14 :X 4 , P:X 5 , P:A 6 ), L 7 :X 7 

of size d(l°,P)-l 

I tupleable, d(l°,l 3 ) th item 

Now we are ready for the full translation. The translation of the F < . fragment of F <: p 
is uninteresting, but we list it for completeness. 

Definition (translation, part 2; F <: fragment) 

Environments Values 

ffhp E envl = h IE! env IE hp a : A] ± [El h [al : [A] 

101^0 lxl±X 

[E,x:A] ± [E],x:[A] [topi ± top 

[E,X<:A1 ± [E1,X<:[A1 [X(x:A)bl ± X(x:[Al)[bl 

Types [b(a)l ± [bl([al) 

[E hp A typel ± [El h [Al type [X(X<:A)bl ± X(X<:lAl)lbl 

[Xl±X [b(A)l ± Ibl(lAl) 
[ Topi = Top Value equivalence 

[A->B]± IA1->IB1 [EV P a^ a' : Al ± [El h [al ^ [al : [Al 
[V(X<:A)B1^ V(X<:[A1)[B1 
Subtypes 

[E \- P A<:Bl ± [El h [Al <: [Bl 

Finally, we can give the translation of the proper F <: p judgments and terms. An F <: p 
judgment E hp -& t L becomes a sequence of size dL of F <: judgments. A row variable 
xt L in an environment becomes a sequence of dL type variables. The domain of row 
function space R f L—>B becomes a sequence of dL domains; similarly for X(x..R\ L), 
with b(rlL) becoming a sequence of dL applications. A row quantifier V(XI L) becomes 
a nesting of dL type quantifiers; similarly for an abstraction X(xt L), with b(RtL) 
becoming a nesting of dL type applications. Record types and values are translated by 
applying i - t L) to the respective rows, and then normalizing the results to size dL. 

Definition (translation, part 3; F <: p proper) 

Environments (continued) 

[E,xtLl ± letX b .X dL =VarSeq(X,tL) in [El,X p ..,X dL 

[E,X.:RtLl = 

let xj..xgi= varSeq(x, ' L) and Aj.Ajl=[R t Ll in [El,xj:Aj..x^i:Ag L 
Types (continued) 

[Rcd(R)]± [R t()l 

[R t L—>Bl = letAj..A dL =[R t Ll in Aj^..^A dL ^[Bl 
[V(XtL)Bl ± letXj..X dL = VarSeq(X, t L) in V(Xj).. V(X dL )[Bl 
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Type rows 

IE \- P R flj ^ letAj.Afc = [R f Ll in [E]\-Aj type ... [El\~A dL type 
IR t Ll = let(l 1 :A 1 ..l dL :A dL ..l n :A n ) =iR fU in A 1 ..A dL _ 1 ,Tuple(A dL ..A n ) 
Subrows 

[E\- P R<.-.S fL]± 

letAj..A dL = lR tU and B^.B^ = [S t Ll 

in [El V- Aj <: Bj ... [El h A dL <: B^ 
Values (continued) 
[rcd(r)l± lrt()l 
la.ll ± lall(l) 
[X(x.:RtL)bl = 

let xj..xj L =varSeq(x, t L) and Aj..Aj l =[R t Ll in X(xj.Aj )..X(xg L :A^ L ) lb! 
ib(rtL)] = leta 1 ..a dL =[r t Ll in ibl(aj)..(a dL ) 
[X(XtL)bl = letX h .X dL = VarSeq(XjL) in X(Xj)..X(X dL )[bl 
Eb(RtL)l± let A b .A dL =[R t Ll in lbl(Aj)..(A dI ) 
Value rows 

IE \- P r.-.R tLl± 

letaj..aj L = lr t Ll andAj..Aj L = [R t Ll in [El h aj.Aj ... [El h ag L :Ag L 
[r t Ll = let (lj=aj..lg L =ag L ..l n =a n ) =ir t Lfr in aj..ag L _j,tuple(ag L ..a n ) 
Value row equivalence 
lE\-pr**r'.:R t Ll ± 

let aj..agi = [r f Ll and a'j..a'gi = lr' fLlandAj..A^ = [R t Ll 

in [El h aj<-^a'j.Aj ... [El h ag L <->a' g L :Ag L 

Examples 

[Mx--(l°:A,Etc)tl 1 ) rcd(l 1 =b,x)l = 

Mx°:lAl) Mx 2 :Top) tuple(x° ,[bl,x 2 ) 
[X(x.-.(l 2 :A,Etc) 1 1 1 ) rcdO^kx)! = 

Mx°: Top) k{x 2 :Tuple([Al,Top)) tuple(x°,[bl,x 2 ) 

[X(Xf l 0 ,?) X(x.:(l°:A,X) t 1 1 ) rcdO^kx)! = 

MX 2 ) X(x°:lAl) l(x 2 :X 2 ) tuple(x° ,[bl,x 2 ) 
[X(Xt I 1 , 1 2 ) Mx-:(l 2 :A,X) 1 1 1 ) rcdO^kx)! = 

MX 0 ) MX 3 ) Mx°: X°) Mx 2 :Tuple(\AlX 3 )) tuple(x° ,lb\x 2 ) 

Using the row structure lemma, we can now show that the translation is well-defined, 
provided that the translated terms are well-typed. 

Lemma (translation dimensions) 

#(VarSeq(X, t L)) = #(VarRow(X, t L)) = #(TopRow( t L)) 

= #(varSeq(x, t L)) = #(topRow{ t L)) = #(varRow(x, t L)) = #(selRow(a, t L)) 
= dL 
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If E\- P R tL then #(iR tU)>dL. 
If E\- P R tL then #(iR t Ll) = dL. 
If E\-pr.:R tL then #(ir fU)>dL. 
If E\- P r. R tL then #(lr t Ll) = dL. 

Lemma (good translation) 

If a judgment J is derivable, then the translation Ul is well-defined. 
That is, all the assumptions made in the translation about sizes of rows, 
are justified. 

The row structure lemma is also the key to the following row analysis lemma, which 
is then used in the proof of all the technical lemmas in the next section. The row analysis 
lemma describes in detail what happens when a single element is added to a row, or 
removed from it. 

Lemma (row analysis) 

(1) Assume E hp l:A,R t L. 

Let Bj..B dlL = [R tl.Ll and Cj..C dL = ll:A,R t Ll 
If l(l)< itL then d(l.L)=dL-l, and: 

C 1 ..C dL = B 1 ..B k _ 1 ,lAlB k .B dL _ 1 
where k = #{i\ie kL a i<i(l)J <dL-l. 
If l(l)> itL then d(l.L)=dL+(i(l)-( Hl+1)), and: 

C l- C dL-l = B l- B dL-l c dL = T wle(B dL ..B aL _j,[Al,B dLL ) 
where d(l.L) > dL. 

(2) Assume E hp l=a,r.-.R t L. 

Let bj..bgi L = lr t l.Ll and Cj..cg L = ll=a,r * Ll 
If i(l)< ifL then d(l.L)=dL-l, and- 

Cp-CdL = b 1 ..b k _ 1 ,lal,b k .b dL .i 
where k = #{i\ie kL a i<i(l)J <dL-l. 
If l(l)> itL then d(l.L)=dL+(i(l)-( itL+1)), and: 

c l- c dL-l = b l- b dL-l c dL = tWle(b dL ..b dLL _jJal,b dLL ) 
where d(l.L) > dL. 

(3) Assume E hp a\L. .R t L. 

Let bj..bji L = la\l.L t l.Ll and Cj..c^ L = laXL f Ll 
If l(l)< itL then d(l.L)=dL-l, and: 
Cj..c dL = b 1 ..b k _ 1 ,la.ll,b k .b dL . 1 
where k = #{i\ie kL a i<i(l)J <dL-l. 
If l(l)> itL then d(l.L)=dL+(i(l)-( itL+1)), and: 

c l- c dL-l = b i- b dL-l c dL = tupM b dL~ b dlL-l^ a - l ^> b dll) 
where d(l.L) > dL. 
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7. The translation preserves derivations 

In this section we show that the translation from hp to h is sound. That is, if a judg- 
ment J is derivable in hp, then all the judgments in the sequence [Jl are derivable in h. 

The following group of lemmas is used in the hardest cases of the soundness proof. 
These lemmas are complicated by the fact that the translations are well-defined only 
under typing assumptions. First we have lemmas regarding rows; they have the structure 
of some of the inference rules, but concern the translation of those rules. 

Lemma (soundness of row inference rules) 

(type row cons) 

Assume E\-pR' t l.L and £ hp A type. 

If lEhpR' tl.Ll and lEVpAtypel then ff£h P l:A,R' t Ll 

(sub row cons) 

Assume E \- P A<:B and E hp R'<S' t l.L. 

If IE \- P A<:B]l and IE \- P R'<.:S' t l.L] then [E\- P l:A,R' <.-. l:B,S' t Ll 

(row cons) 

Assume E hp a: A and E hp r.R t l.L. 

If IE hp a:Al and IE hp r.R tl.Ll then IE\- P l=a,r.-.l:A,R t Ll 

(selection) 

Assume E hp a : Rcd(l:A,S). 

If IE hp a : Rcd(l:A,S)ti then IE hp a.l : Al 

(restriction) 

Assume E hp a\L .-. l:A,S t L. 

If [E\- P a\L.: l:A,S t Ll then lEVp aXLL .-. S tl.Ll 
(eq-cons) 

Assume £ hp r <-> r' .: R * l.L, and E hp a<->a':A. 
If IE\- P r**r'.-.R tl.Ll and lE\- P a**a':A] 
then lEVp l=a,r l= a ',r' .-. l:A,R t Ll 

(eq-selection) 

Assume E hp a<->a' : Rcd(l:A,S). 

If IE hp a^a' : Rcd(l:A,S)l then IE hp a.l — a'./ : Al. 

(eval-selection) 

Assume E hp r.-.i? 1 1 and £ hp a<->a':A. 

If ff£ hp r. i? 1 11 and f£ hp a^a'.Al then ff£ hp rcd(l=a,r).l ** a' : Al 

(eq-restriction) 

Assume E hp a\L<->a\L .-. l:A,S t L. 

If iE\- P aXL^a\L.: l:A,S t Ll then iE\- P aM.L^aM.L .-. 5 f/.Zj. 
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( eval-restriction ) 

(1) Assume E\- P r ** r' .-. R t (). 

If lEVpr^ r' .-. R t ()] then [E hp rcd(r)\ «-» r' .-. R t()l 

(2) Assume E\-p rcd(l=a,r)\L l=a,r' .-. l:A,R t L. 
If IE hp rcd(l=a,r)\L l=a,r' .-. /.A,fl f Zj 
then f£h P rcd(l=a,r)\l.L ^ r' .-. 7? f /.Zj. 

Next we have substitution lemmas for all possible combinations of variables and 
terms. 

Lemma (soundness of substitution) 

(type in type) 

Assume E \-pA'<:A and E,X<:A,E' hp B type. 
Then IB{X^A'}1 is well-defined 
Then lBl{X*-lA'l} = lBpt—A'}l 

(type in row-type) 

Assume E\- P A'<:A and E,X<:A,E' hp 5 ?M 
Then lS(X^A'} t Ml is well-defined. 
Let B 7 ..B^ M = lStM] and C ; ..Q M = lS{X^A'} t Ml 
Then BjfX^lAl} = C 1 ... B dM {X^lA'l} = C dM . 

(row-type in type) 

Assume E\-pR IL and E,XI L,E' hp B type. 
Then [BfX^RJl is well-defined. 
LetX/.J^ = VarSeq(XjL) and A 7 ..A A = fffl f Zj 
Then iB]{X 1 ^A 1 }..{X dL ^A dL } = IB{X*-R}1 

(row-type in row-type) 

Assume E\- P R fL and E,xtL,E'\- P S t M. 

Then lS{X^R} t Ml is well-defined. 

LetXj..X dL = VarSeq(XjL) and A 7 ..A A = [R t Ll 

Let B r .B dM = IS tMl and C 1 ..C dM = lS{X<—R} t Ml 

Then B i {X 1 *-A 1 }..{X dL <*-A dL } = Q for / m i..o!M. 

(Xype in value) 

Assume £h P A'<.A and E,X<:A,E' \- P b : B 
Then IbfX^A'Jl is well-defined. 
Then !b]{X«-!A']J = lbpt—A'}l 

(type in row-value) 

Assume Eh P A'<.A and E,X<:A,E' hp 5 .-. S t t t M 
Then ls{X^A'} t Ml is well-defined. 
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Let bj..b dM = [st Ml and cl..c dM = ls(X^A'J t Ml . 
Then bjfX^lA'l} = c 7 ... b dM {X^lA'l} = c dM 

(row-type in value) 

Assume E\- P R t L and E,xt L,E' hp c: C. 
Then [cfX^RJl is well-defined. 
LetXj..X dL = VarSeq(XjL) and Aj..A dL = 1R t Ll 
Then I clfXj <—A 1 }..{X dL ^A dL } = I c{X*-RJl 

(row-type in row-value) 

Assume E\- P R fL and E,xt L,E' hp s .-. S t M. 
Then ls{X^R} t Ml is well-defined. 
LetXj..X dL = VarSeq(X,tL) and A 7 ..A A = [R t Ll 
Let bj..b dM = Is tMl and c 1 ..c dM = ls{X^Rj f Ml 
Then b^Xj^Ajj.^X^^AgjJ = c t fori in L.dM. 

(value in value) 

Assume E\-pa : A and E,x:A,E' \-pb : B 
Then [b{x<—ajl is well-defined. 
Then [blfx^lalj = lb{x*-a}L 

(value in row-value) 

Assume E\-pa : A and E,x:A,E' hp 5 .-. S ? M 
Then [s{x<—aj t Ml is well-defined. 
Let bj..bgM = I s t Ml and Cj..cg M = ls{x<— a} tMl 
Then b]{x<—ialj = cj ... bj M {x<—lal} = cj M . 

(row-value in value) 

Assume E\-pr .-. R t L and E,x. R t L,E' hp c : C. 
Then Hc{x<—rjl is well-defined. 
Let xj..xj L = varSeq(x, t L) and aj..ag L = lr I Ll 
Then [cl{xj<—ajJ..(xji<—a^iJ = lc{x*—r}l. 

(row-value in row-value) 

Assume E\- P r.-.R tL and E,x..R t L,E' hp s .-. 5 ?M 
Then [s{x<—rj t Ml is well-defined. 
Let Xj..xjl = varSeq(x, ? L) and aj..a^ = [r t Ll 
Let bj..bg M = Is tMl and Cj..cj m = ls{x<-r} tMl 
Then bi{xj<—aj}..{x^<—agjj = ci foriinL.dM. 

Finally we have the soundness theorem, divided into mutual induction groups. 
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Theorem (soundness) 

(1) hp E env => ffhp E envl 
E\- P A type => lE\- P A type] 
E\- P R tL => IE\- P R tLl 

(2) E\-p A <: B => IE hp A <: B] 
E\- P R<.:StL => [E\- P R < S f Ll 

(3) E\- P a:A => iE\- P a:A] 
E\- P r.-.R tL => lEh P r.:R tLl 

(4) E\- P a^a':A => lEVpa^ a' : Ai 
E\-p r <-> r' .: R t L => lEVpr <-* r' .-. R t L] 

Proof 

The proof is by simultaneous induction on the derivations, using the lemmas above in 
the hard cases. □ 



8. Conclusions 

We have defined a calculus of row variables, F K .p, and translated it into a simpler 
calculus with subtyping, F <: . The constraints imposed by the translation have forced us 
into a restricted subset of the features that have been proposed for calculi of extensible 
records, but we can still express many benchmark examples. 

The particular mixture of features chosen for F <: p is not uniquely determined. For 
example we might have attempted to incorporate bounds on row quantifiers 
(V(X<.-.RtL)B), row-valued functions (A—>f L R), or record concatenation (sketched in 
section 5.5). The point is that many possible variations can be described and evaluated 
within a single basic framework. Underlying all these variations and bridging between 
them there is F <: , often extended with recursion. This approach could provide us with a 
fundamental and unified framework in which to study complex features of object-oriented 
languages. 
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